arXiv: 1501.04514vl [math.NA] 19 Jan 2015 


COMPRESSIVE SPACE-TIME GALERKIN DISCRETIZATIONS OF 
PARABOLIC PARTIAL DIFFERENTIAL EQUATIONS 

STIG LARSSON AND CHRISTOPH SCHWAB 


Abstract. We study linear parabolic initial-value problems in a space-time variational for¬ 
mulation based on fractional calculus. This formulation uses “time derivatives of order one 
half” on the bi-in£nite time axis. We show that for linear, parabolic initial-boundary value 
problems on (0, oo), the corresponding bilinear form admits an inf-sup condition with sparse 
tensor product trial and test function spaces. We deduce optimality of compressive, space- 
time Galerkin discretizations, where stability of Galerkin approximations is implied by the 
well-posedness of the parabolic operator equation. The variational setting adopted here ad¬ 
mits more general Riesz bases than previous work; in particular, no stability in negative 
order Sobolev spaces on the spatial or temporal domains is required of the Riesz bases ac¬ 
commodated by the present formulation. The trial and test spaces are based on Sobolev 
spaces of equal order 1/2 with respect to the temporal variable. Sparse tensor products of 
multi-level decompositions of the spatial and temporal spaces in Galerkin discretizations lead 
to large, non-symmetric linear systems of equations. We prove that their condition numbers 
are uniformly bounded with respect to the discretization level. In terms of the total number 
of degrees of freedom, the convergence orders equal, up to logarithmic terms, those of best 
N-term approximations of solutions of the corresponding elliptic problems. 
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1. Introduction 

For a bounded linear and self-adjoint operator A G C(V,V*) in an evolution triplet V C 
H ~ H* C V*, and a bounded domain D C MA, we consider the initial boundary value 
problem for abstract, linear parabolic evolution equations 

(1.1) Bu := dtu + Au = f in ]R> = (0,oo), 
with homogeneous initial condition 

(1.2) ti(0) = 0. 

In (jl.ip . we think of A as linear, strongly elliptic (pseudo)differential operator of order 2m > 
0, and of y as a closed subspace of H^{D) supporting homogeneous, essential boundary 
conditions of the initial boundary value problem ()l.ll) , ()1.2p . 

Optimality of adaptive variational space-time Galerkin discretizations of (H]), dri on 
(0, T) for T < oo were shown for the first time in |20] . There, well-posedness of suitable space- 
time variational saddle-point formulations of the parabolic initial boundary value problems 
0, (na) were established. By means of tensorized Riesz bases of the Bochner spaces 
which underlie the space-time variational formulations, the parabolic initial boundary value 
problems were converted to equivalent bi-infinite matrix problems. These matrix problems 
were subsequently solved numerically, in optimal complexity, by means of adaptive wavelet 
discretizations from [6]. We note that adaptive wavelet techniques from [6] were essential 
in the algorithms in [20] . since it used the paradigm “stability by adaptivity” from [6]. In 
particular in [20] . no stability result for nonadaptive discretizations could be obtained, but 
rather followed from the well-posedness of the infinite-dimensional problem, the Riesz basis 
property and certain optimality properties of the adaptive Galerkin discretizations (“stability 
by adaptivity”). 

In the present paper, building on fractional calculus techniques pioneered in variational 
formulations of parabolic initial boundary value problems by M. Fontes IIOIIII] . we propose a 
space-time variational formulation based on bilinear forms, which are, unlike the formulations 
considered in [20], “symmetric” in the sense that trial and test spaces, which arise in the 
variational formulation, are Sobolev spaces of equal orders with respect to time differentiation. 
Stability (in the sense that a discrete inf-sup condition holds) of our space-time Galerkin 
discretization requires that the finite-dimensional trial and test spaces are different. 

The presently considered space-time variational formulation admits a unique variational 
solution in a Bochner space X, which is intermediate to the solution spaces which are obtained 
by the “classical” approach. Moreover, as shown by M. Fontes in mm, the presently 
considered solutions can be obtained by monotone operator methods and, therefore, Galerkin 
approximations are well-defined and stable with any closed subspaces, including in particular 
sparse tensor products of multilevel hierarchies in space and time. It is interesting to note 
that time derivatives of order 1/2 were used already in [21 [T7] in order to prove error estimates 
in the X-norm for hnite element approximations of (ll.ll) - (ll.2l) . 

As in m, we establish in the present paper quasi-optimality of linear and nonlinear space- 
time adaptive and compressive Galerkin discretizations in the space-time cylinder. To this 
end, we show a discrete inf-sup condition in the present paper, for a suitable sparse tensor 
space-time Petrov-Galerkin discretization. The use of wavelet-type Riesz bases in space and 
time then results in uniformly bounded condition numbers of the hnite-dimensional prob¬ 
lems; notably, this holds without the Riesz basis property in V* of the spatial wavelet basis 
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S, which was essential in [20] . In the presently considered variational formulation, we con¬ 
sider in particular long-term evolution, i.e., the time interval (0,T) with T = oo, and analyze 
space-time compressive and adaptive numerical approximation of long-time integration for 
these problems. Unlike mm, we obtain stability, multilevel preconditioning and space-time 
compressibility even without adaptivity, and with trial and test spaces of equal dimension 
(albeit being possibly different so that we consider a Petrov-Galerkin formulation as in [T]). 
Moreover, the optimality results in Section [S] entail optimal, adaptive and space-time com¬ 
pressive methods for long-time integration (i.e., T = oo) for parabolic evolution problems. 

The outline of this paper is as follows: in Section [21 we present basic definitions and 
facts from functional analysis and fractional calculus. In Section [3l we present the space- 
time variational formulation of (H]), (1121). Section [2 we consider compressive space-time 
Galerkin discretization with sparse tensor subspaces. Section [5] addresses the space-time 
adaptive discretization of the variational formulation in Section [3] and establishes optimality. 
The analysis in Sections [SHS] is developed for long-time integration, i.e., for T = oo. 


2. Preliminaries 


2.1. Functional analysis. We require some tools from functional analysis. Throughout this 
paper all vector spaces are real unless explicitly stated otherwise. Consider two Banach spaces 
X and Y and a bilinear form B: X xY ^ M, which is bounded, i.e., there exists a constant 
C such that 


(2.1) \B{w,v)\ < C'll'wllx ||t’||y Vrc € X , v €Y . 

We are interested in solving the linear, variational problem: for each F S Y*, find a unique 
u G X such that 


(2.2) B{u,v) = F{v) VueU. 

The form B{-, •) induces in a one-to-one fashion a bounded, linear operator B G C{X, Y*) via 

Y* {Bw,v)y = B{w,v) Vu) G X , V €Y , 

so that the unique solvability of (|2.2p is related to the question of bounded invertibility of 
the operator B G £{X,Y*). There holds: 


Proposition 2.1. Let X,Y be Banach spaces; Y reflexive. Let B: X xY 
bilinear form and consider the inf-sup condition: 

B{w, v) 


he a bounded 


(2.3) inf sup 

o^weXQ^y^Y ll'it'l|x||'y||Y 

and the (adjoint) injectivity condition: 


> 7 > 0 , 


(2.4) sup B{w, u) > 0 VO 7 ^ u G y . 

w€X 

The conditions (1221)-dzai) hold if and only if for each F £Y*, the variational problem (1221) 
admits a unique solution u G X and in this case there holds the estimate 

7 

In other words, (1221)-(1221) hold if and only if the corresponding operator B G /1{X,Y*) is 
boundedly invertible, in which case x) < ■ 
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The Proposition 12.11 was used in [20] in verifying that space-time saddle point formulations 
of (11.11) are well-posed. Below, we shall be interested in the following special case where 

X = y. 

Corollary 2.2. Assume that X is a reflexive Banach space, and that the bounded bilinear 
form B: X X X ^ M is coercive-equivalent, i.e., there exists an isomorphism S G C(X,X) 
such that B{-,S-) is coercive, i.e., there exists c > 0 such that 

(2.5) B{uj, Svu) > c\\tv\\'j(; Mw G X . 

Then the corresponding operator B G C{X,X*) is boundedly invertible. 

Proof. We assume (|2.5p and verify conditions (|2..Sp - ()2.4p in Proposition 12.11 For 0 w G X, 
we have <C 5 ||r(;||jis: and Sw 0, since S is an isomorphism. Together with (12.5p this 

leads to 

B{w,v) ^ B{w,Sw) ^ \\wfx ^ c I, „ 

sup -Tj-w— > II „ II — > c > — pw X ■ 

o^vGX ItIIx Ipuiilx |pw||x cs 
This proves (12.3D . To verify (12.4D we compute 

sup B{w, v) > B{S~^v, v) = B{S~^v, S{S~^v)) > c||S'“^u||^ > -yI|u||^ > 0 

wGX Cg 

ioY 0 ^ V E X. □ 

2.2. The elliptic operator. We let {H, ■) h ) and {V, (•, •)v") denote two separable Hilbert 

spaces with dense embedding V C H and duals H* and V*. We identify H ~ H* according 
to the Riesz representation theorem and obtain the Gel’fand triple 

V CH cxH* dV* , 

again with dense injections. Let A G C{y,V*) he a. bounded self-adjoint linear operator such 
that the corresponding bilinear form a{v, w) = y* {Av, w)y is coercive and bounded onV xV, 
i.e., for some 0 < A_ < A+ < oo, 

(2.6) a{v,v) > A_||u||y , 10 ( 1 ;,rc)| < A+||u||y||t(;||y . 

Example 2.3. In a bounded Lipschitz domain D C M”’ of dimension n > 1, we consider the 
linear, second order divergence form operator given for v G C^{D) by 

Av = —V ■ (a(x)Vu) -|- c{x)v . 

Here, a G (L°°(D))g^ and c G L°°(D) satisfy the ellipticity conditions 

37 > 0 G M"’ : Eo'C > ! ess inf c(x) > 0 . 

xGD 

In this case V = Hq{D), H = I?‘{D), a{v,w) = {aX/v,Vw) + {cv,w), and ()2.6D is valid. 
Example 2.4. With D as in Examvle \2.tA we consider the Stokes equation. Then 
H = {v G lfl{D)'^ : divu = 0 in L^{D) , 7 o(u • n) = 0 in H~^{dD)} , 

V = {vG Hl{DY : divu = 0 in L^{D)] , 

where 70 denotes the trace operator and the bilinear form is given by a{w, v) = Xw : Vv dx. 
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2.3. Bochner spaces. We require Bochner spaces of vector-valued functions defined on in¬ 
tervals. For an interval /, a Banach space X with norm || • ||x, and for 1 <p < oo, we denote 
by LP{I]X) the space of strongly measurable functions u \ I ^ X such that 


f||LP(7;A:) - 


\u{t)r 


\ 1/p 

< oo 


for 1 < p < oo with the usual modification for p = oo. Similarly, we denote by H^{I]X) 
the space of functions whose distributional time derivative belongs to Lp‘{I]X). We also 
need spaces of continuous functions: for k € Nq, we denote by C^{I\X) the Banach space 
of fe-times continuously differentiable and bounded mappings u: I ^ X endowed with the 
standard norm || • Wc^i^p^x)- 


2.4. Interpolation spaces. We repeatedly use assorted facts from the theory of function 
space interpolation (see, e.g., B Ha ES]). In particular, we use the interpolation spaces 
< s < 1, between two Hilbert spaces with dense embedding X C Y, as dehned, for 
example, in [181 Chap. 1, Def. 2.1]. 

For 0 < T < oo we denote by / = (—T, T) the symmetric interval, with / = R implied if 
T = oo, and set I> = I n {t > 0}. For a separable Hilbert space H, we define 

(/>;//) := {v € H) : u(0) = 0} . 

By the continuity of the embedding C H), the set FTq Ff) is the 

null space of the trace operator at t = 0 and, therefore, a norm-closed, linear subspace of 
H). We introduce the interpolation spaces 

H) := H), H)], , s € (0,1) , 

:= H), H)], , s G (0,1) , 

:= H), H)]^ , s G (0,1) \{i} , 

:= [L2(/>;FF),FF1{o}U>;^)]i • 

Remark 2.5. With /> = (0, T) for 0 < T < oo there holds: 

(1) Consider the interpolation spaces [L^(/>; FF), FF^ |p|(/>; FF)]^ for 0 < s < 1, s ^ For 

0 < s < i, holds that [L2(F>;FF),FFo\^oj(F>;FF)], = H), H)]„ 

i.e., the homogeneous boundary condition at {0} is “not seen” by the interpolation space, 
whereas for ^ < s < 1 we have that 

[l2(F>;FF),FFo\^o}(F>;FF)], = H) C [L\ly, H), H^Iy, H)]s = 

is a subspace which is norm-closed in H), [THl Chap. 1, Remarque 11.3]. 

1 

(2) The space |q|(F>; H), which will be important in the present paper, is strictly included 

in FF 2 (F>;FF) = [L^(F>; FF), FF^(F>; FF)] i with a topology which is strictly finer than that of 

dHl Chap. 1, Thm. 11.7]. 

1 1 

(3) FFqq |g|(F>; FF) is not closed in the norm of It is a dense subspace / |16[ 

1 1 

Theorem 1.4.2.4] with p = 2) and the embedding FF^q |qj(F>; FF) C FF 2 (F>;FF) is continuous, 
m Lemma 4.8]. 
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The following intrinsic characterizations of the spaces of order ^ will be useful. We refer 
to m Chap. 1], in particular, for the first one Theoreme 9.1 and (10.23) in Section 10.3, and 
for the second one, Theoreme 11.7 and Remarque 11.4. 


Proposition 2.6. Let I = (—T, T), /> = (0,T) forT € (0, oo]. 

(1) The interpolation space H) consists of all u G which are equal to the 

restriction to /> of some u G H^{I]H). The interpolation norm of H^{Iy]H) is equivalent 
to the intrinsic norm || • || i given by 


(2.7) 


hi 








s — t] 


(2) The interpolation space (/>; H) consists of all u G H 2 (/>; H) such that the function 

s h-)■ s“ 2 tt(s) belongs to L?‘{Iy]H) with intrinsic norm || • || 1 given by 


H, 




( 2 . 8 ) 


1 


^ 00 , 10 } 






s — t\ 


'!> 


The constants implied by the norm equivalences are independent of T ^ (0,oo]. 


2.5. Fractional calcnlns on the half line. To render our presentation self-contained, we 
recapitulate here fractional calculus from m as necessary by our subsequent analysis. 

For (p G L^(K>;C), a G (0,1), the Riemann-Liouville fractional integrals [191 Def. 2.1] are 

{l+(p)it) = [ {t- s)“"V(s) ds , t G M> , 

r(a) Jo 

(I“(/))(t) =-— / (s - t)“"V('S) ds , tGM>. 

r(a) Jt 

Then we have integration by parts [191 (2.20) and Corollary to Theorem 3.5 p. 67]: 

(2.9) [ (l’f'ip){t)^{t)dt= [ ijj{t)(lf(p){t)dt 
and the semigroup property [T9l (2.21)]: 

(2.10) = rff) , a,^>0. 

The proofs of (12.91) . (12.101) are elementary calculations with integrals. 

By D we denote the time derivative of order 1 and we define time derivatives of fractional 
order a G (0,1) for u G C“(M), 

(2.11) (D+u)(t) := (DI^"“u)(t) = —^--D [ {t - s)""u(s) ds , 

r(l-a) Jq 

1 r°° 

(2.12) (D-w)(i) := -(Dllr“u)(t) = ----D / (s - t)“"u(s) ds . 

r(i-a) Jt 

We require a space of test functions, which is closed under the action of D“ and D"; to this 
end we introduce (cp. [T^ l 

J'(M;C) := {u G C'°°(M;C) : ||u11h'>(R;C) <00 Vs G M} . 
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The set J^(M; C) is a Frechet space with respect to the topology induced by the family of norms 
{|| ■ 11//“(«;€)}*£« we have the dense embeddings P(M;C) C 5(M;C) C J^(M;C) C £^(M;C) 
(where T>, S, and £ are the classical test function spaces). We observe that the definitions 
(12.111) . (|2.12p remain meaningful for u G C). We further define test function spaces 

C) = {u G C) : 3u G J^(]R; C) such that u = } 

and, with Eq the “extension by zero” operator, 

Jo(K>;C) = {u€ C“(M>;C) : Equ G J'(M;C)} . 

The subspaces /o(I^>;C) C |qj(]R>; C), J^(M>;C) C iF 2 (M^;C) are dense, see [T^ 
Lemma 3.7]. 

We denote the corresponding spaces of distributions by J'q(]R>;C) = J^(R>;C)* and 
J^'(M>;C) = Jo(lR>;C)*. Then it follows that, see [121 (2-24)-(2.29)], 

D“: Jo(K>;C) ^ Jo(K>;C) , D": 7'(M>; C) ^ C) , 

D“ ; J'^(M>;C) ^ J'^(M>;C) , D“ : 7-'(R>;C) ^7''(M>;C) . 

Here the Eq (M> ; C) distribution derivative D“ means 

(D"u, (/.)=/ uD"0dt V()> G J^(M>;C) , 

J M.> 

and the J^^(M>;C) distribution derivative D“ means 


(D"u,0)=/ uBl^dt V(/> G J'o(K>;C) . 

We can now prove a relevant integration by parts formula. 

1 

Lemma 2.7. The J'q(M>;C) distribution derivative Drc of w G FF^q |q^(M>; C) satisfies 
(2.13) (Du;,?;) = f T)^wDfvdt Vu G J-'(M>; C) . 

J ]R.> 

Proof. By dehnition we have 

(Dr(;,u) = / re (—Du) dt = (re, —Du) Vu G J-'(M>; C) . 

1 

Since J^o(l^>;C) C FF^q |q|(M>; C) is dense, it suffices to show 

{w, —Du) = (D^ru,Diu) \/w G Jo(I^>;C) , u G F^(M>;C) . 

1 1 

If ru G J'o(I^>;C), then u;(0) = 0, so that w = = Iflf.ijj, where if = Dru. Similarly, If 

1 1 

u G J^(M>;C), then u(oo) = 0, so that u = V_(j) = PP0, where = —Du. Therefore, by 
integration by parts (12.9p . 

(u;,-Du) = = {llfifilfi) = (D|u;,Diu) . 

□ 
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2.6. Extension by zero. In the previous Subsection 12.41 the space H) is charac¬ 

terized by means of an intrinsic norm. Here we give an alternative characterization in terms 
of extension by zero, denoted Eq. 


>; H) is equals {w € H ‘2 (M>; H) : Eqw £ (M; H)}. 


Proposition 2.8. The space |q| 

1 

A norm on H), which is equivalent to (|2.8p . is given by HEq • || i ; more 

uu,\Uj- 

1 . 

precisely, for every w G H) there holds 

(2.14) 


ikf 1 


< \\Eow\f 1 < 2||r(;|| i 


2 

Proof. For a proof of the first part, we refer to [121 Lemma 3.5]. It remains to show (I2.14p . 

1 1 

Consider an arbitrary w G H). Then w = Eqw G H 2 (]R;H) and = 

■If) = \\Eow\\L 2 (^.^jf). We next compute with the seminorm defined in (I2.18p below: 


w 


\Eow\‘^ 1 = 1 


-ff 

J —OO j —CO 

w{t) - w{t')\\‘jj 
>0 Jo 1 ^ “ 

'\wit) - w{t')rH 


— w{t' 






dt dt' 


roo roo 

Jo Jo 

roo r 

Jo Jo 


\t — t'p 

By comparison with the seminorm part of the norm 


J t'=—oo J t=0 

' + 2 f 

Jo 


!”(*)-“(*')ll«dtdt' 




w{t)\W 

t 


dt 


, we conclude that 


kl" 1 

h:^ 


-‘oo,{o} 

and the proof is complete. 


< \Eow\ 




< 2\w\ 


H, 


00 ,{ 0 }'' 


.■,H) 


□ 


2.7. Further characterizations. The preceding function spaces are intimately connected 

\ 1 

to the fractional derivatives and on J^(M>;C). As these derivatives are essential in 

the proposed space-time formulation, we discuss their properties in detail. By continuity, 

1 1 

the operators T)f extend to bounded operators from H 2 (M>; C) to C). The following 

1 

proposition collects several properties of D^. 


Proposition 2.9. Let H denote an arbitrary Hilbert space over M. Then there holds 

1 

(1) A function u G belongs to H) if and only if its Jq(M>;77)- 

1 

derivative Idfu G L^(M>;77). 

(2) A function u G L^(M>;77) belongs to if and only if its E'H)-derivative 

V)lu G L2(M>;H). 

1 . 

(3) A norm on H), equivalent to the norm (12.81) . is given by 


W\? 1 




.\H) 


L2(K>;H) 




(2.15) 


2 


2 
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A norm on equivalent to the norm (12.7D . is given by 


(2.16) 








Moreover, for every u € (M; H) there holds 




+ IlDiu 




= Kt 




+ IlDiul 


L2(R;i^) 


iDiu 


-/ / 

JseM. JtG 


!“(“)-“Will'd,* 


(2.17) 

(2.18) ^I _ |.^i . . . It, +1 

Is&RJt&R P “I 

For the proof of ()2.15p . (I2.16p we refer to [T2l Lemmas 3.5, 3.8] and to [T2l Lemmas 3.6, 

_ 1 

3.9], respectively. The identity (I2.17K is immediate from the Fourier characterizations of 
in m Sect. 3]. For (I2.18p . we refer to [HI (4.13)]. We remark that the expression | • | i 

introduced in (12.181) is indeed a seminorm, as it vanishes on all functions u a H independent 
of t. 

In view of Lemma 12.71 and Proposition 12.91 it is now clear that the bilinear form (Drc, v) is 
bounded on |q|(]R>; H) x iLz (M>; H). 

2.8. Coercivity over M. A key ingredient in the theory of Fontes is that the time derivative 
is coercive in the sense of Corollary 12.21 for functions defined on M. We demonstrate this here 
by considering the operator, with A as in Subsection 12.21 

Bv = T)v + Av , V € J'(M; V) . 

1 

By fractional integration by parts (immediate from the Fourier characterizations of D^), we 
find 

(2.19) {Bw,v) = / ^(D|.r(;, DEu)j:/+ a(rc, u)^ dt , re, u G J^(M; 17) . 

J M. 

We also define the operator 

H" ;= cos(7ra)I + sin(7ra)H , a G M , 

where H is the Hilbert transform acting with respect to the t-variable. By using (12.6p . we 
then obtain the fundamental coercivity inequality, for any w G J^(M; V) 

{Bw, = (Drc + Aw, cos{7ra)w — sin(7ra)Hr(;) 

1 i_ 

= cos(7ra)(Dr(;, tc) — sin(7ra)(D^r(;, D^Htc) 

+ / ( cos(7ra)a(tc, re) — sin(7ra)a(r(;, Hu;)) dt 

Jr 

> sin(7ra)||D = u;||^ 2 (R.H) + (A_cos(7ra) - A+sin(7ra)) ]|u;]|^ 2 (R.y) , 


because (Du;,u;) = 0, ||Hu;|| 2 , 2 (R.y) < ||u^||L 2 (R.y), and Df^H = —Df^H 2 = —D^, see 

Fixing the parameter a > 0 sufficiently small, by density of J^(]R; V) in iL 2 (M; H) n L^(M; V), 
and (12.171) . we find the coercivity inequality (cp. Corollary 12.2^ : there exists c > 0 such that 


(Bu;,H °‘w) > c(||u;|p 1 
' / - VII 


l^lli2(R;y)) Vu;G77^(M;F)nL2(M;H) . 
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Hence, by Corollary 12.21 and Proposition 12.11 we conclude that the bilinear form in (12.1911 
satisfies the inf-sup conditions (12.3p . ()2.4I1 with X = Y = n L^(M; V). 

2.9. Coercivity over M>. In order to prove the inf-sup condition (12.31) for functions on 

1 

M>, we take an arbitrary w G //) (nL^(M>;H). Then its extension by zero, 

w = Eqw, belongs to (0 according to Proposition 12.81 Similarly, if u G 

(M; i?) nL^(M; V), then its restriction to M>, v = R>v, belongs to (M^; nL^(R>; V) 

according to Proposition 12.61 ('ll. We have the bounds 

( 2 . 20 ) 

( 2 . 21 ) 

Moreover, 


pc 1 

Hi. 


^gj(K>;LI)nL2(R>;y) - ’ 


(R;_H')nL2(R;y) 


(Di»){*) = ' d/_‘^ 


. ^w'(s)ds, t>0 

[t — s) 2w[s) ds = < '• 2 '' 

I 0 , t < 0 


that is, D^E'oU’ = EqD^w. Similarly, 
1 


{BIv){t) = -r^B f {s-t) 2v{s)ds = -r^B f {s -1) 2v{s) 
1 [n) Jt 1 (.ol Jt 


ds , t > 0 


i 1 

that is, ii>D^u = Hence, 


{BlEQW,BIv)Hdt= {EQBlw,BIv)Hdt= / {Blw, R>BIv )h dt 


{Blw,B^_R>v)Hdt . 


If we denote by Hr(-,-) and Hr^(-,-) bilinear forms as in (I2.19p computed over M and 
respectively, then we conclude that 




( 2 . 22 ) 


B^^{w,R:^v) = B^{Eow,v) . 


The inf-sup condition proved in the previous subsection means that for each w € (R; H)r\ 

L^(R; V) there is a u G fl 2 (R; H) n L^(R; V) (namely, v = H““tc) such that 

(2.23) 




> C kc 1 

— " "H^{R;H)nL‘2{R;V) 


For arbitrary w G |q|(R>; H) n L^(M>; V), we let w = Eqw and take v as above and set 
V = i?>u, that is, V = ii>H“"FioU’. Then, by (12.20^ . (I2.2ip . (I2.22p . and (I2.23p . we obtain 
B^^{w,v) ^ Bm,{w,v) 


> 




(R;/I)nL2(R;y) 


> c\\w\ 


H7(R;H)nL2(R;y) 


> c||u;| 


' H^\^^.^{R>-,H)nL^{R>y) 


This is the desired inf-sup condition. 
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3. Linear parabolic evolution equations 

We present a space-time variational formulation of the initial boundary value problem for 
the abstract, linear parabolic evolution equation (11.11) with homogeneous initial condition 
(1121). For the operator A € C{V,V*), we assume (12.61) . In what follows, all Hilbert spaces 
are taken over the coefficient held M. Using the function spaces developed in Section [21 we 
now state the weak form of the linear parabolic initial-value problem (II.1|) . (11.21) : it is based 
on the Bochner spaces 

X = //) n U) ^ ® V ), 

Y = nL2(M>;U) ~ n (L2(M>)(gU) . 

Here, ® signihes the Hilbert tensor product space endowed with the (unique) cross norm. 
The parabolic operator takes the form B = T) + A with the J'Q-distributional derivative D 
introduced in Section 1221 Lemma [27n 

Besides the spaces X and Y in (13.11) . we will also need the space 

(3.2) Z = . 

We shall make use of the following continuity properties of extensions and restrictions which 
follow from Proposition 12.61 and Proposition 12.81 

Proposition 3.1. For X, Y, and Z as in ()3.1I) . (13.2p there holds: 

(1) X C Z with continuous embedding given by the zero extension Eq. 

(2) Y = Ry{Z) with i?> denoting the operator of restriction of elements o/L^(]R;LI) to M>. 

(3) Z* ~ (iL5(M;Lr))* +l2(M;U)* ~ + L ^{ R ; V *). 

(4) Y* is isomorphic to {g ^ Z* ■. supp(5) C M>}. 

(5) X is a dense subset ofY, that is, X^^ = Y. 

From A € C{V,V*) it follows that B := D + A € C{X, U*). More precisely, there holds for 
every v G X, 

Bv = {D + A)v = Dv + Av G {H^Ry, H))* + L‘^{R>-,V*) 

~ {H^{Ry;H)y + L‘^{R>;Vy ~ (l/^ (M>; if) n L2(M>; U))* = U* . 

For any source term f G Y*, we consider the space-time weak formulation of (II.ip . (II.2D : hnd 

(3.3) uGX: Bji+A{u, v) = F{v) Vu € T . 

Here, the linear functional F{-) is dehned by 

F{v) = {f,v) \/vGY 

with (•, •) denoting the Y* x Y duality pairing. The bilinear form is given by, cp. Lemma (2.71 

(3.4) Bi)+a{'w,v) := j UBfw,Bfv)H + a{w,v)\dt , wGX, v gY , 

where X and Y are as in (|3.ip . The form Bd+a(', •) in (13.4p is continuous by Proposition 12.91 

1 1 

(1) and (2), stating that for every w G H) we have lYfw G L^(M>;fi) and that 

1 1 

for every v G ii 2 (M>;ii) we have Dfv G L^(M>;ii). 

The unique solvability of (13.3D was proved in [121 Sect. 4.1] by extension to a problem over 
M, where coercivity in the sense of Corollary 12.21 can be proved, see Subsection 12.81 As a 
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result of the unique solvability of (13.31) we conclude that the inf-sup conditions (12.3j) . (|2.4p 
hold. We formulate this in the following proposition. 

Proposition 3.2. Suppose that assumption (12.6p holds. Then, for the choice ()3.1I) of spaces, 
the bilinear form dsai) satisfies the continuity condition m and the inf-sup conditions 
()2.4p . In particular, for every f £Y* there exists a unique solution u € X of ()3.3p . 

Proof. We observe that Y ~ and that X ~ i?g’J((5_|_) in the notation of [121 

Thm. 4.3, Sect. 4.1] with p = 2. It is shown there that the operator B = D + A £ C{X, Y*) is 
bijective. Therefore, Proposition 12.II implies the inf-sup conditions (12.3p . (12.4p for the bilinear 
form (|3.4p on the spaces X y.Y m. (13.11) . □ 

4. Sparse tensor Galerkin discretization 

Having established the well-posedness and the unique solvability of (jl.ip . (II. 2p we now 
turn to Galerkin approximations. Rather than considering time-stepping (as studied, e.g., 
in [23]), we are interested in compressive space-time Galerkin discretizations, as analyzed for 
the first time in [20]. We present and analyze adaptive, compressive, space-time schemes 
which are based on the weak space-time formulation (|3.3I) . The adaptive, and space-time 
compressive schemes inherit, being instances of the general theory in laE], stability from 
the well-posedness of the infinite-dimensional problem shown in Proposition 13.21 and from 
the stability of the Riesz bases. As in [20], they are based on tensor product constructions 
of Riesz bases of X and Y ; however, the variational formulation obviates the need for 
stability of Riesz bases in negative order Sobolev spaces. We present classes of spline wavelets 
in the time domain and also in the spatial domain D C M”, which we assume to be a polygon 
or polyhedron. Rather than focusing on a particular family of wavelets, we specify several 
axioms from [2D| to be satisfied by the tensorized multiresolution bases in the spatial and 
temporal domains in order for our analysis to apply. We assume that V and H are modeled 
on Sobolev spaces on the bounded Lipschitz polyhedron D C M”, n > 1. As in [5D], our 
analysis accommodates two cases: case m- n = 2, 3 and D is a bounded polyhedron with 
plane faces; and the high-dimensional case ([B|): re > 1 and D = (0,1)”. In D we consider 
general elliptic operators A of order 2m, m > 1. The generic example is A = —A, V = Hq{D), 
and H = Lp‘{D), in which case rre = 1. The domain for the parabolic initial-boundary value 
problem is the space-time cylinder Q> := M> x D. 

4.1. Space-time wavelet Galerkin discretization. The Galerkin discretization of the 
space-time variational formulation (|3.3I) will be based on two dense, nested families 

of subspaces of X and Y as in (|3.ip . The inf-sup condition (|2.3p makes it necessary to 
allow X^ 7 ^ Y^ (leading in effect to Petrov-Galerkin discretizations), so that Proposition 12.11 
is used in full generality. As indicated above, we choose as tensor-products of 

spaces of continuous, piecewise polynomial functions of t G M> and x £ D, m order to 
obtain good (space-time compressive) approximation of solutions, whereas Y^ will be selected 
to ensure good stability. Multiresolution bases will be required to ensure: (a) multilevel 
preconditioning, i.e., all stiffness matrices have (generalized) condition numbers, which are 
bounded independently of t, and (b) matrix and (space-time) solution compression. 

Thus, we consider the Galerkin discretization: to find, for I £ Nq, 

(4.1) £X^ : Bd+a{u^, v^) = F{y^) V/ G Y^ . 
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We assume that 

N(, = dim(X^) = dim(y^) < oo , 

such that CX = (M> ■H)nL‘^ (M> ;V) and C Y = (M> ; H) n L‘^{R>;V) are 

closed and and U^g^^^ are dense in X, respectively in Y. Proposition 12.11 implies 


Proposition 4.1. Assume that the Galerkin discretization 
sense that there exists 7 such that, for all £ G No, 

Br,+A{w,v 


KTh of m\i is stable, in the 


(4.2) 


inf sup 


re X w Y 


> 7 > 0 . 


Then, for every F G Y* and for every £ G N, the Galerkin approximation gH) admits a 
unique solution G X^. In particular, the (in general, non-symmetric) stiffness matrix 
corresponding to gH) is nonsingular. Let u ^ X be the corresponding unique solution to 
O and C be the constant in (EU). Then there holds the quasi-optimality estimate 

(j 

(4.3) \\u — u^\\xY— inf lltt — u^llx ■ 

7 


The proof of Proposition 14.11 is straightforward: existence and uniqueness of in ()4.ip 
and the invertibility of the Ni x matrix follows from ()4.2H with Proposition [2T] The error 
estimate (14.3p follows from the Galerkin orthogonality 

Bii+A{u-u^,v^) = d Vl;^GF^ 

by noting that the error isu — u^ = {I — R^){u — v^), where is the Ritz projector that maps 
u I—>■ u^. Therefore, (14.3p holds with constant ||/ — R^\\c{x,x) = \\R^\\c{x,x) < C'/Tj 12Z]- 
For preconditioning and efficient computation, as well for adaptive space-time Galerkin 
discretizations with optimality properties, the concept of Riesz basis takes a central role. 


4.2. Riesz bases and bi-infinite matrix vector equations. We assume at hand a Riesz 
basis : A G V^} for X. The Riesz basis property amounts to saying that the 

synthesis operator 

: £2(V^) -a X :ceA := ^ cxip^ 

Aev-^ 

is boundedly invertible. Rs adjoint, known as the analysis operator, reads 

: X* -A ^ 2 (V'^) : g 1 -^ [gi'f’x )]agV-^ • 

Similarly, let : A G V^} denote a Riesz basis for Y, with synthesis operator 

and adjoint Ahead, we construct Riesz bases T"’'- and by tensorization of wavelet 

bases in ]R> and in D. 

By Proposition 13.21 R = D + A G C{X,Y*) is boundedly invertible with the choice of 
spaces in m- We may write (j3.3p equivalently as operator equation: given f gY*, find 

(4.4) uGX : Bu = f in Y* . 

Writing u = s,j,xu, (13.31) and (14.41) are equivalent to the bi-in£nite matrix vector problem 

(4.5) Bn = f , 

where f = s'^vf = [/(V^DlAeV^ e ^2 (V^) , and where the “stiffness" or system matrix 
B = s'.vRs^x = [(RO(V’D]agV-,;.GV- € 
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is boundedly invertible. We may write 

Bdj^a : X X y ^ M : {w,v) i-A {Bw){v) , 
and we also nse the notations 


B = ^d+a(^^,^^) and f = /(^ 




With the Riesz constants 




■— ll'Sipv||£(£2(V-^),X) — 


c' 


sup 

o^cg£2(V'^) I|c|h2(v^) 


\x 


\X 

A^x ■— IIS 


= inf 




».vllaA-,4(V.V)) ||c||,,,vx, ’ 

and analogous constants and A^y, the bounded invertibility of B G C{X,Y*) implies 
that the condition number of B is finite, i.e., 

\\^\\c(i 2 (XYMX^)) - \\^\\c{X,Y*)^^x^^y , 

\B~^\\c{y*,x) 


IB 


\cmx^)Mx^)) ^ —xA 


We next construct Riesz bases of the spaces X and Y in (13.11) . 


4.3. Riesz bases in and We assume at our disposal two countable 

collections 0^,0^ C i/^(]R>) of functions such that 

0^ = : A G Vf} C Ro'{o}(^>) 

is a normalized Riesz basis for L^(M>) which, when renormalized in R^(M>), is a Riesz basis 
for Analogously, we assume available 0^ = {6^ : A G V^} C R^(M>), a Riesz 

basis of L^(M>) which, when renormalized in R^(M>), is a Riesz basis for R^(M>). 

From Proposition 12.61 we obtain the following result. 

Proposition 4.2. Assume given two collections 0'’^ and 0^ with the above properties. Then, 
/or 0 < s < 1, the collections [0'’*']s and [0^]s, which are obtained by rescaling 0^ and 0^ by 
{2^l^l : A G VJ, (e.g., [0^], = : A G Vf are Riesz bases of 

and of [L^(M>), R^(R>)]s, respectively. In particular, for s = ^, [0'’^]i is a Riesz basis for 
1 ^ 

Rqq |gj(R>) and [0^] i is a Riesz basis for 

We denote by 6^ elements of the collection 0^ and, likewise, by 9^ elements of 0^. 
Further assumptions on the bases 0^, 0^ are as in |20j : denoting by 9x a generic element in 
either of the collections 0^ and 0^, we require the 9\ to be 

(tl) local: that is, sup^g^^ if{^ ^ l-^l = ^ S supp^x} < oo and Isupp^xl ^ 2“l^l, 

(t2) piecewise polynomial of order dp. here, “piecewise” means that the singular support 
consists of a finite number of points whose number is uniformly bounded with respect 
to |A|, 

(t3) globally continuous: specifically, ||0 a|Ive,^(m>) ^ 2 l'^l*^2+*^) for k G {0,1}, 

(t4) vanishing moments: for |A| > 0, the 9x have dt > dt vanishing moments. 
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Properties (ltT|) - (lt^ are assumed to hold for both 0^ and 0^. We remark that property 
()^ , global continuity, is necessary to ensure H 2 (R> )-conformity, even though H 2 (1R>) is 
not embedded into C'^(M>). 

Properties (ltT])- ()t^ can be satisfied by collections 0^, 0^ that are continuous, piecewise 
polynomial wavelet bases on dyadic refinements of M>, which are of order d* > 1. For /c G Nq 
we denote by the set of A G Vj with refinement level |A| < k. It holds that ~ 2^. 

Setting also v| := 0, we define the biorthogonal projector ;= Q^(k) by 

QKt'^= 


where 0''’^ denotes the dual basis, and analogously for := (^^,y We have 


Pb - Qk,t\\ciH<it{M.y),L^(R^)) ^2 , ||Id - 






and analogously for with H 2 (M>) in place of ,{0} (^>)- 

Constructions of compactly supported spline wavelet systems 0 on (—1,1) and on M, as 
well as direct constructions (i.e., not based on antisymmetry) of Riesz bases 0^ and 0^ on 
M> satisfying properties (ItTD- dMIl with 9^{t)\t=o = 0 are available, for example, in [11I51IUII2B] 
and the references there. 


4.4. Riesz bases in H and V. With H = L?‘{D) and the assumption that V coincides with 
a closed subspace (supporting homogeneous essential boundary conditions) of the Sobolev 
space R'"(IA) for some m > 0, we assume at our disposal a Riesz basis 

S = {(Ta : A G V J C P . 


Specifically, S is a collection of functions that is a normalized Riesz basis for H which, upon 
renormalization in V, is a Riesz basis denoted [S]y for V. Riesz bases of divergence-free 
functions in the context of Example 12.41 are constructed in [221126] and the references there. 
For the spatial wavelet basis S, we consider as in [20], two cases: 

(A) it is a wavelet basis of order dx > m with isotropic supports constructed from a dyadic 
multiresolution analysis in Lp‘{D), 

(B) D = (0,1)"" and S is the tensor product of (possibly different) univariate wavelet bases 
Sj as in (TAl) in each of the coordinate spaces. 

In case m, for some sufficiently large K depending on m, where 2m is the order of A, and 
for some G Nq such that m — I<r 3 ;<d 3 ; — 2 and dx G Nq, we will assume that the a\ are 
(si) local and piecewise smooth: for any £ G Nq there exist collections {Di „ : v G O^} of 
disjoint, uniformly shape regular, open subdomains such that D = [Jx£OeDl,v, is 
the union of some ~ 2“^, suppuA is connected and is the union of 

a uniformly bounded number of D^x\,vj each has non-empty intersection with the 
supports of a uniformly bounded number of a\ with |A| = i, and, for k G {0, AT}, 


Fa|Iw4(D|^i,,) 


< 2l^Kf+^) 


(s2) globally C'"^: specifically, ||<7a||w'' (d) ^ 2l'^l^2+^) for k G {0,ra; -|- 1}, 
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(s3) for |A| > 0, have cancellation properties of order dx- 

j wax < iov k e {0,dx}, w e W^{D) nV . 

(s4) In addition to dsT]) . we assume that for any i and v € Of, there exists a sufficiently 
smooth transformation of coordinates k, with derivatives bounded uniformly in £ and 
V, such that for all |A| = i, {ax o k)|k-i(_d^ „) is a polynomial of some hxed degree. 

For case m, we assume that each of the Sj satisfies the above conditions with {D, n) = 
((0,1), 1). In this case, we assume that the wavelets are piecewise polynomials of order dx, 
with those on positive levels being orthogonal to all polynomials of order dx that are in V. 


Assumption 4.3. The bi-infinite matrices M = and A = a([S]y, [Sjy) for the 

spatial operators in dSH) are s* computable, in the sense that for each N £ 'N, there exist 
approximate matrices Mjv and A^v with at most N non-zero entries in each column and such 
that, for every 0 < s < s*, the expressions 

sup A||M — , sup A||A — AatII^/^ 

TVeN NeN 

are finite. Here, || • || denotes the spectral norm. 

A number of practically viable constructions of Riesz bases S, which satisfy Assumption |431 
for several classes of operators A £ C{V,V*) have become available in recent years: for 
example, for second order, elliptic divergence form differential operators A, and also for self- 
adjoint, integro-differential operators A of fractional order (in which case V coincides with 
the domain of Az); also tensorized S for diffusions on D = (0,1)” have become available, 
which satisfy Assumption 14.31 We refer to [20l Sect. 8.3] for this. For 0 < s < 1, we denote 
by [Sjs the Riesz basis S rescaled to [H,V]s. 

4.5. Riesz bases in X and Y. We assume that we have at our disposal Riesz bases 0^ = 
{0^ : A e V^}, 0^ = : A £ V^} of L^(M>) for which rescaling renders 0^ a Riesz 

basis of and 0^ a Riesz basis of FI^(M>). The bases [0^]i and [0^]i are then 

defined as in Proposition 14.21 In the spatial domain D, we assume available a Riesz basis 
S = {ax '■ A £ Vj;} of H which, when rescaled to V, becomes a Riesz basis [Sjy for V: 
P]v = {aA/||uA||R : Ae VJ. 

Proposition 4.4. Given Riesz bases 0^, 0^ and S o/L^(M>) and H, respectively, as above, 
the collections := 0^’^-(8)S , ;= 0^(8) S , are Riesz bases of L‘^{'S.y-, H) ~ L^(]R>) (8)i4. 

Moreover, the collection 


: = 




{t, x) !->• 


: (A,/i) G := Vf x V, 




H. 


00 ,{ 0 }'' 


is a Riesz basis for X = R) n V), and the collection 


{t, x) i-A 




a 


uWv 




:{X,ti)£S£^ :=VY x V, 


: = 
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is a Riesz basis for Y = (M>; H) n V). 

The Riesz constants for and depend only on the respective Riesz constants for 0^, 
[0^]i, 0^, [0^]i and for Y, [S]y. 

Proof. The Riesz basis property for follows from our assumptions on 0 and S, the result 
13 Prop. 1, Prop. 2] on tensor products of Riesz bases and from Proposition 14.21 □ 

4.6. Space-time compressible approximation rates of smooth solutions. Using the 
tensor product Riesz bases of X in Proposition 14.41 in a Petrov-Galerkin discretization 
(14.ip of the space-time variational formulation (|3.3p allows for space-time compressive approx¬ 
imations of smooth solutions, provided test function spaces are available which are stable, 
i.e., which satisfy ()4.2I) . The approximate solutions thus obtained will be quasi-optimal. Such 
stable test spaces can be constructed on the basis of the coercivity property in Subsection l2.9l 
However, we shall not develop this here but refer to [lOl Chapt. 5]. Likewise, in the adap¬ 
tive setting, sequences of approximate solutions are produced, which converge at best possible 
rates, when compared to best A^-term approximations of the solution. We therefore exemplify 
the best possible approximation rates in X which can be achieved in terms of the parameters 
dt and dx- 


4.6.1. Best rate in case dAD. For any A C V^,, let Q\ : L‘^{D) span(0A : A € A) denote the 
L^(Il)-biorthogonal projector associated to S and A. The assumption of S being of order dx 
means that, with being the set of A € V^; with refinement level |A| < /c € Nq, it holds 
that ffV^x'^ ^ 2^"'. Setting vl := 0, we obtain for the projector Qkx '■= Q.^(k) that 

||Id - Qfc,x||£(//d^(D)nv,v) ~ 2 — Qk,x\\c{H‘i^(D)nv,H) ■ 

In case dt < with i/k G + e,^ — e] for (small) e > 0, we have 


k i 

Id ^ ^ ^ ® {Qq,x 

p=0 q=0 


Qn- 


1.x ) 


^ 2—kdt 


Ylp=oTll=o(Qp.t ~ Qp-i,t) G {Qq,x - Qq-i,x) is the L2(L))-biorthogonal projector asso¬ 
ciated to the tensor product basis '!' = 0 (g) S and the “sparse” tensor-product index set 

Aa := U^=o X , 

which satisfies if {A a) ^ 2^, see [T4] . 

In view of the approximation orders of the bases being applied, and the tensor product 
1 

structure of Al = ,{0} ; H) n V), by interpolation we obtain the rate 

M . / 7 1 cix tn \ 1 


with e > 0 arbitrarily small due to the appearance of logarithmic factors. This rate is best 
possible for functions which are smooth with respect to x and t, and for Riesz bases S with 
isotropic supports in D as are admitted in case 
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4.6.2. Best rate in case ©• Throughout the discussion of case we assume n > 2 (the 
case n = 1 being a particular instance of dA])). For 1 < i < n, let 14 be either 1) or 

a closed snbspace incorporating essential boundary conditions. Let Sj = : Xi G Vj} be 

a normalized Riesz basis for Hi := L^(0,1), that renormalized in 14 is a Riesz basis for I 4 . 
For any Aj C Vj we denote by Q\^ : L^(0,1) ^ span(0;^ : A G Vj) the L^(0, l)-biorthogonal 
projectors associated to Sj and A*. The assumption of Sj consisting of continnons, piecewise 
polynomial fnnctions of order dx means that, with = {A G Vj : |A| < A: G No}, on any 
hnite snbinterval (0, T) C M> it holds that ^ 2^ (with the constant implied in ^ being 

0{T)). With the convention V^- := 0, and = 0, we have for Qk^i := Q^{k) that 

||Id — (5fc,i||/:(Hd„(o,i)nVi,Vi) ~ Pd — <3fc,i||/:(Hda:(o,i)nVi,i7i) ~ 2 . 

The collection S := = {ax := : A G V^, := 0?=! ^ normalized Riesz 

basis for L?‘{D). Rescaling this basis in 


F := n(Li 0^=1 IT*, 


where ITj,- := 



when j ^ i , 
when j = i , 


it is a Riesz basis for V as well. 

Recall that for any A C V^,, Qa denotes the L^(L))-biorthogonal projector associated to S 
and A. As shown in there exist “optimized” sparse product sets C C ■ • • C 

Vx and C C • • • C V^, with ^ 2^ ~ such that with Qk x ■= and 

with Qk^x := Qyik), and 


:= n(Li , where Z*,- 


Hj , when j ^ i , 

(0,1) n F ) when j = i , 


it holds that 


\\ld — Qk^x\\c{H<^^(D),V) Pd — Qfc,a:||£('H‘^2:(E)),_f7) ~ 2 . 

Choosing as index set Ab the union of sparse products of the index sets (v|^^)o<p<A: with 
(vi?^)o<q<£ or iy^x^)o<q<e. for suitable k and i, we obtain L^(]R> x D)-biorthogonal projectors 
associated to Xah C X = c1osa:(0®S) that, for u G ni/Q|Q|)(M>)(g)'R'^'“(L)), with a set 

of at most N basis fnnctions give rise to an error in |q|(M>; V) of order 

for s = 0,1. Interpolation between L^(M>) and LAg |g|(M>) results, by Proposition 12.61 in the 

norm of the Bodmer space X ~ LIgg ig) T in the (best possible, for smooth functions) 

rate 

(4.7) mm{dt - ^,dx - m) . 


Summarizing (14.6p and (14.7p . for solutions which are smooth functions of space and time, the 
rate 


(4.8) 


•^max 


min((it — i, — e in case ([X]) , 

min((it — ^,dx — m) in case ([B]) . 


is realized with the index sets Aa, Ab C V^. 



18 


S. LARSSON AND CH. SCHWAB 


5. Adaptivity 

The sparse tensor space-time Galerkin discretization gH) based on the a priori choices , 
XAg of sparse tensor product trial spaces and the corresponding testfunction spaces Ia^, Iab 
lead to quasi-optimal approximations; the quality of the Galerkin approximation thus being 
determined by the best approximation property. Alternatively, following [3 [13], (sequences 
of) subspaces = X^x C X and C Y may be selected adaptively, with sequences 

{^A;}a;>o C Vt X of sets of “active” basis elements 0AG)cr^G'k = 0(8>S determined so 
as to ensure optimality properties of the corresponding Galerkin approximations Uax for the 
given set of data. In doing this, a key role is played by the (approximate) computability of 
(finite sections of) the bi-infinite matrix B defined by 

(5.1) B=((Dj[eqp,Dhe''ip)y„^i®(s,E)„+(0yeq,^,„^,®a(p]„,[E]„). 

We recapitulate basic properties of adaptive wavelet-Galerkin methods, in particular, the 
notions of admissibility and computability of the corresponding discretized operators; our 
presentation will be synoptic, and we refer readers who are unfamiliar with these to 1211 
[20|. We will, in particular, review the notions of s-admissibility, s-computability and s- 
compressibility of Galerkin matrices of operators. Finally, we obtain an optimality result for 
the adaptive wavelet Galerkin discretization of the space-time variational formulation (j3.3p : 
the sequence of Galerkin solutions produced by the adaptive scheme is optimal in the norm of 
X with respect to the best A-term approximation of the solution in space-time tensor product 
wavelet bases; thereby offering the first result on optimality for a nonlinear and compressive 
algorithm for long-time parabolic evolution problems. This is distinct from laiiQi, where the 
constants in the error and complexity estimates depend on the length of the time interval. 

5.1. Nonlinear approximation. Nonlinear approximations to u € A are obtained from its 

coefficient vector u by best A-term approximations utv. These vectors, with supports of size 
A € No, encode the A largest coefficients in modulus of u. For s > 0, the approximation 
class A^(£ 2 (V^)) := {v € ■ ||v|U^(£ 2 (vx)) < oo}, where 

I|v|Uj^(£ 2 (V^)) := supd X [min{A e Nq : ||v - ^ 

(5>0 

contains all v whose best A-term approximations converge to v with rate s. 

Since best A-term approximations involve searching the entire vector v, they cannot be 
realized in practice. In addition, for a solution u € A of the PDF (11.11) . the vector u to be 
approximated is not explicitly available. It is only given implicitly via (11.11) . (11.21) through the 
(equivalent) bi-infinite matrix vector problem (14.5p with respect to some Riesz basis Our 
aim is to construct a practical method that produces approximations to u which, whenever 
u G A^{i 2 {Y^)) for some s > 0, converge with this rate s in linear computational complexity. 

5.2. Adaptive Galerkin methods. Let s > 0 be such that u G In [6] and the 

references there, adaptive wavelet Galerkin methods for solving (|4.5p were introduced. These 
methods are iterative methods which address the non-elliptic nature of the operator dni) by 
iterating, instead of (|5.ip . on the associated normal equations, i.e., on the linear system 

(5.2) B*Bu = B*f . 

Key ingredients in the estimates of their complexity are asymptotic cost bounds for approxi¬ 
mate matrix-vector products in terms of the prescribed tolerance e. 
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Definition 5.1. (s*-admissibility) B € £(^ 2 (V^),£ 2 (V^)) is s*-admissible if there exists a 
routine 

APPLYB[w,e] ^ z 

which yields, for any e > 0 and any finitely supported w G ^ 2 (V^), a finitely supported 
z G ■^ 2 (V^) with ||Bw — z||£ 2 (vi') ^ ^ for which, for any s G (0,s*), there exists an 

admissibility constant g such that ^suppz < o,nd the number of 

arithmetic operations and storage locations used by the call APPLYB[w,e] is bounded by 
some absolute multiple of 

#supp w + 1. 

One key step in adaptive wavelet methods for (li3D is thus the construction of a valid 
routine APPLYB[w,e] for the bi-infinite matrices B defined in ()5.1h . 

In order to approximate u one should be able to approximate f. Throughout what follows, 
we therefore assume availability of the following routine. 

RHSf [e] —>■ fe : For given e > 0, it yields a finitely supported G ■^2(V^) with 

l|f - < e ond #suppf£ < min{A : ||f - fAr|| < e}, 

with the number of arithmetic operations and storage locations used by the call RHSf[e] 
bounded by some absolute multiple o/^^suppfe -|- 1. 

The availability of APPLYb and RHSf implies the following result. 

Proposition 5.2. Let B in (14.5p he s*-admissible. Then for any s G (0,s*), we have 
l|B||£(^|^(£ 2 (v^)),^i,(£ 2 (vy))) < a^B,s- Porz, := APPLYB[w,e], there holds < 

aB,5l|w|U|^(£2(V^)). 

For proofs, we refer to [6] or m Prop. 3.3]. Using the definition of A^(.^ 2 (V^)) and the 
properties of RHSf, we have 

Corollary 5.3. If, in (14.5p . B is s*-admissible and u G A^(£ 2 (V'’*')) for s < s*, then for 
fg = RHSf[e], ^suppfe < aB,se~^^^||u ||^/with the number of arithmetic operations 
and storage locations used by the call RHSf[e] being bounded by some absolute multiple of 

Remark 5.4. Besides jjf—fe||f 2 (v’^) ^ complexity bounds in Corolla, rv \5.,I\ with ob ,S ^ 0 

being independent of e are essential for the use of RHSf in the adaptive wavelet methods. 

The following corollary of Proposition 15.21 can be used for example for the construction of 
valid APPLY and RHS routines in case the adaptive wavelet algorithms are applied to a 
preconditioned system. 

Corollary 5.5. //B G C G T(£ 2 (V^), .£ 2 (V'^)) are both s*-admissible, 

then so is CB G T(.£ 2 (V^),^ 2 (V^)). A valid routine APPLYqb is 

(5.3) [w,e] APPLYc[APPLYB[w,e/(2||C||)],e/2] , 

with admissibility constant ucB.s ^ aB,s(||C||^/^ -|- ac,s) for s G (0, s*). 

For some s* > s, let C G £(.^ 2 (V^),£ 2 (V^)) be s*-admissible. Then for 

(5.4) RHScf[£] := APPLYc[RHSf[e/(2||C||)],e/2] , 
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there holds 


#suppRHScf[e] < aB,s(||C||^ 
||Cf-RHScf[e]||,,(v^) <c, 




^Sc(^2(VX)) 


with the number of arithmetic operations and storage locations used by the call RHScf[e] 
bounded by a multiple of 

Remark 5.6. RHScf allows to approximate Cf in the sense of Remark \5.4\ 

Consider first the case that B is self-adjoint positive definite, i.e., and B = B* > 

0. In this case the adaptive wavelet methods from [6] are optimal in the following sense. 


Theorem 5.7. (l6l[T3]J If in B is self-adjoint positive definite and s*-admissible, then 
for any e > 0, the adaptive wavelet method from [6] produces an approximation to u with 
||u — U£||£ 2 (v 2 f) < £• If (|4.5H for some s > Q it holds u € then ^suppu^ < 

and if, in addition, s < s*, the number of arithmetic operations and 
storage locations required by one call of either of these adaptive wavelet solvers with tolerance 
£ is bounded by a multiple of 


£ + aB,5)||u||y/ 


+ 1 . 


d^2(V-^)) 

The factor depends only on s when it tends to 0 or oo, and on ||B|| and ||B' 


The adaptive Galerkin discretization method from [5] for self-adjoint operators B consists 
of the application of a damped Richardson iteration to Bu = f, where the required residual 
computations are approximated using calls of APPLYb and RHSf within tolerances that 
decrease linearly with the iteration counter. 

With the method from [5], a sequence Hq C Hi C • • • C is produced, together with 
corresponding (approximate) Galerkin solutions u* € £ 2 (^ 1 )■ The coefficients of approximate 
residuals f — Buj are used as indicators how to expand Hj to Hj+i such that it gives rise to 
an improved Galerkin approximation. 

The method of [S] relies on a recurrent coarsening of the approximation vectors, where small 
coefficients are removed to maintain optimal balance between accuracy and support length. 
We have s*-admissibility of B once the stiffness matrix with respect to suitable wavelet bases 
is close to a computable sparse matrix. The next definition makes this precise. 

Definition 5.8. (s*-computability) B G C{£ 2 (V^),i 2 (S/^)) is s*-computable if, for each 
N € N, there exists a B^r G C{£ 2 {V^),£ 2 {V^)) having in each column at most N non-zero 
entries whose joint computation takes an absolute multiple of N operations, such that the 
computability constants 

are finite for any s G (0, s*). 


Theorem 5.9. An s*-computable B is s*-admissible. Moreover, for s < s*, aB,s ^ cb,i 
where the constant in this estimate depends only on s 0, s j' s*, and on ||B|| —>■ 00 . 
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This theorem is proven by the construction of a suitable APPLY b routine as was done in 
[5l §6.4], see also [2T] and the references there. 

The non-elliptic nature of B was addressed in [6] by applying the adaptive schemes to the 
normal equations (15.2p : From Subsection 14.21 we deduce that B*B G T(^ 2 (V^),.^ 2 (V^)) is 
boundedly invertible, self-adjoint positive definite, with 

I|B*B||£(^2(V'^)A(V-^)) < l|B|l£(£2(v-^),^2(vy)) ’ 

||(B*B) - II® ^ll/:(£2(vi^),£2(V'^)) ■ 

Now let u G A^(£ 2 (V^)), and assume that for some s* > s, both B and B* are s*-admissible. 
By Corollary 15.51 with B* in place of C, a valid RHSb*! routine is given by (15.4p . and B*B is 
s*-admissible with a valid APPLYb*b routine given by (15.3p . A combination of Theorem 15.71 
and Corollary 15.51 yields the following result. 

Theorem 5.10. For any e > 0, the adaptive wavelet methods from applied to the normal 
equations ()5.2p using above APPLYb*b and RHSB*f routines produce approximations 
to u which satisfy jju —Ue||^ 2 (v^) — Ufor some s > 0, u G A^(£ 2 (V^)), then ^suppu^ < 

with constant only dependent on s when it tends to 0 or oo, and on ||B|| 
and ||B~^|| when they tend to infinity. 

If s < s*, then the number of arithmetic operations and storage locations required by a call 
of either of these adaptive wavelet methods with tolerance e > 0 is bounded by some multiple 
of 

1 + £ + a-B,s{I + ®B*,s))||u||]J//(£2(V^)) 

where this multiple only depends on s when it tends to 0 or oo, and on ||B|| and ||B~^|| when 
they tend to infinity. 

5.3. s*-computability of B in (15.ip . We apply the general concepts to the space-time 
variational formulation (13.3p and the space-time tensor-product wavelet bases = 0^ ® S 
and = 0^ (g) S in Proposition 14.41 

Due to the discussion in Section [4.61 it suffices to show s*-admissibility of both, B and B*, 
for s* > Smax with Smax as defined in (14.8p . The bi-infinite matrix B defined in (15.ip comprises 
of a sum of tensor products of bi-infinite matrices, each factor matrix corresponding to either 
the Gram matrices {&>,&>)or (S, or of the “stiffness” matrices a([S]y, [Sly) 

with respect to the Riesz bases 0> and S (cp. Section [43]) . 

To apply the general theory of adaptive wavelet discretizations of laisiEi], the key step is 
the verification of s*-compressibility and of s*-computability of the matrix B in (15.IK . 

We verify s*-computability of B in (15.ID with the following result [20l Prop. 8.1]. 

Proposition 5.11. Let for some s* > 0, D, E be s*-computable. Then 

(a) D (g) E is s*-computable with computability constant satisfying, for 0 < s < s < s*, 
CD®E,s < (cd, 5 Ce,s)^/^ and 

(b) for any e G (0, s*), D (g) E is {s* — e)- computable, with computability constant CD(giE,s 
satisfying, for 0<s<s* — £<s<s*, CD(g)E,s ^ max(cD,s, 1) max(cE,s, 1). 

The constants implicit by < in the bounds on the computability constants in (jaj) and 
depend only on s, s ^ oo and on s — s 4 0. 
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We recall that we work under Assumption I4..SI so that the bi-infinite mass matrix M = 
(S, and the bi-infinite stiffness matrix A = a([S]y, [S]y) are both s* computable and 

compressible under our assumptions (|sT]) - (is^ . 

5.4. s*-computability of the fractional time derivatives. Proposition ETT] and Assump¬ 
tion 14.31 reduce the analysis of s*-compressibility of B in (|5.1I) to the verification of the s*- 
compressibility of the temporal “stiffness” and “mass” matrices 

(5.5) D:=(Di|0^1.,D!|0ni>i=,»,,. G := (0^,, 

i.e., on the compressibility of the “stiffness” matrix D and of the “mass”-matrix G of the 
fractional time derivative in (13. 4p . 

We discuss s*-computability of D and G in the sense of Definition 15.81 We assume at 

1 

our disposal Riesz bases 0^ of ,{0}(^>) and 0^ of as in Section 14^ and, in 

particular, that properties (jtT])-(jtl]) of that section hold for elements of either of these bases. 

The s*-computability of G follows as in [201 Sect. 8.2] from the properties (|tT])^(jt31) of 0^ 
and 0^. It remains to address s*-computability of D in (|5.5p . 

To this end, we observe that by a density argument, Lemma 12.71 and, in particular, the 

_ 1 

fractional inteerration bv Darts identity (12.131) remain valid for w G ,{0} (^>) and for v € 

772 (M>). Since 0^ is a Riesz basis of |q|(]R>) and 0^ of 772 (K>), we obtain from (12.131) 
that 

d = <d|[0'O4.d![0’'1i 4.(.>) = (Dps'll. 

Now using properties (ltTp - (lt4p of the temporal wavelet bases 0^ and 0^, we establish s*- 
computability of D as in [20l Sect. 8.2]. 


5.5. Optimality. The preceding considerations can be combined into 


Theorem 5.12. Consider the parabolie problem (HH), dLl in the weak form (I33D with 
spatial bilinear form as in Section IH.Hl Consider its representation Bu = f using temporal 
and spatial wavelet bases 0 and S as above. 

Then for any e > 0, the adaptive wavelet methods from |6] applied to the normal equations 
(15.2p produce an approximation with 


]|u — uj [0 (g) S]]|x ||u — Uell < e . 

If for some s > 0, u € A^(72(V^)), then suppug < with the implied 

constant only dependent on s when it tends to 0 or oo. 

If, for arbitrary s* > 0, it holds s < s*, then the number of operations and storage locations 
required by one call of the space-time adaptive algorithm with tolerance e > 0 is bounded by 
some absolute multiple of 




l/s 

. 4^0 (^2 ( V ^)) 


+ 1 . 


Here, the implied eonstant depends only on the Riesz and the admissibility constants of the 
spatial wavelet bases S. 
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